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(54) Providing secure access through network firewalls 



(57) To enable controlled, secure connections using 
a versatile protocol such as TCP/IP to be established 
through a firewall and proxy server, the versatile proto- 
col is tunnelled using HTTP. Client to server communi- 
cations are effected using an HTTP POST operation. 
Server to client communications are effected using an 
HTTP GET operation to establish a tunnelled socket; 



this socket is closed within an interval less than any 
timeout imposed by the proxy server and immediately 
re-established by another GET operation, irrespective 
of whether data continue to be pending for communica- 
tion to the client. A globally-unique ID is included in each 
POST and GET message to enable related messages 
to be recognized. 
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Description 

Technical Field 

s [0001] This invention relates to methods and apparatus for providing secure access between a service external to 
a network firewall (such as a web page server) and a client internal to the firewall (such as a web browser). 

Background Art 

10 [0002] Facilities for global access to information, such as the World Wide Web, are undergoing immense growth and 
expansion, both in volume of data transferred and in sophistication of tools, such as web browsers, for accessing those 
data. This in turn requires the flexibility and capability of the data network infrastructure to be increased, without at the 
same time jeopardising its security features. Often these objectives conflict. 

[0003] Web based applications are "client-server - in nature. A client-based browser runs on an end user's computer, 
communicating via network links with a web server operated by a service provider. The server provides data defining 
the content of web pages which are rendered and displayed by the user's browser; typically the web page content is 
defined using a markup language such as Hypertext Markup Language (HTML). The communications between the 
browser and the web server are conducted in accordance with a protocol called Hypertext Transfer Protocol (HTTP), 
defined in the Internet Engineering Task Force's RFC 1 945. HTTP is a simple text-based protocol that has the peculiar 

20 quality that messages conforming to it are trusted and allowed to pass around the internets, intranets and extranets 
that make up today's "Internet". This Internet is really a series of closely coupled networks linked together through 
■firewalls": network nodes that allow controlled, restricted access between two networks. However the ability to "browse 
the world wide web" is seen as a universal common denominator, and as such HTTP messages are allowed to pass 
through these firewalls unchecked. There have been a number of enhancements to the HTTP protocol: the description 

25 herein relates by way of example to the basic version, HTTP/1 .0, which is supported universally and with which later 
versions of HTTP are backwards compatible. Nonetheless the invention is not limited to use with HTTP/1 .0 or indeed 
any other specific version of HTTP, and the claims hereof should be construed accordingly. 

[0004] Web pages defined using the markup language can be enhanced by the use of code written in the Java 
programming language; this allows dynamic content and interactivity to be added to web pages. Java components can 
30 run either on the server end of the network connection, as "servlets", or on the client browser machine, in which case 
they are known as "applets". Many potential security problems were envisaged with the introduction of applets, such 
as "viruses" and Irojan horses", so a tight set of security restrictions were imposed on what applets could and could 
not do. To this end applets are executed on the client machine in a controlled environment called a "sandbox". This 
sandbox defines how the applet can interact with the resources available in the computational platform the applet is 
35 running on, via a limited application programming interface (API). For example, the applet typically cannot interact with 
the local disk, nor connect to other computers on the network in unrestricted fashion. However the applet can typically 
connect back to the web server it was served from, although by way of an HTTP connection only. References to Java 
herein relate by way of example to Java/1 .1 which is widely deployed; later versions of Java provide enhanced network 
support, but are backwards compatible with this base version. 
40 [0005] The first generation of web content was mainly very static in nature - like pages from a magazine with text 
and pictures. The second generation of web content became increasingly dynamic, providing a user interface for ap- 
plications, such as database queries. The third generation of web content is becoming increasingly interactive, with 
real-time communications, such as video, text chat and Internet Telephony, being added as an integral part. Internet- 
based client-server applications normally operate by opening "sockets" between the client and server, using Transac- 
ts tion Control Protocol/Internet Protocol (TCP/IP). TCP/IP sockets provide bi-directional, reliable communications paths. 
These can be used to implement dynamic or interactive client-server based applications, such as are required by the 
second and third generations of web content. 

[0006] However, these more sophisticated applications pose a problem, in that the communications protocols they 
often require for interaction with the web server, such as TCP/IP, are intentionally barred by firewalls and proxy servers. 
50 it is therefore an object of this invention to provide clients, such as Java applets in a sandbox, with some controlled 
ability (within the context of a specific web-based application) to interact through a firewall with other resources, such 
as web servers, using protocols in addition to HTTP. 

Disclosure of Invention 

55 

[0007] According to one aspect of this invention there is provided a method of permitting secure access between a 
service external to a network firewall and a client internal to the firewall, comprising the steps of: 
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(a) effecting an HTTP GET operation or equivalent thereof (which may be in a protocol functionally equivalent to 
HTTP) from the client to establish a communications socket for communicating data from the service to the client; 

(b) after a predetermined interval effecting another GET operation or equivalent thereof to close the communica- 
tions socket, irrespective of whether access between the service and the client is required to continue; and 

s - repeating steps (a) and (b) while access between the service and the client is required to continue. 

[0008] The HTTP Tunneling Socket described herein as an example of this invention is a socket that operates on 
top of the HTTP protocol. For interaction with a Java-based application it provides the standard APIs as defined for a 
socket in the Java/1.1 API specification. For interaction with the network it uses the HTTP/1 .0 protocol. This technique 

10 in which one protocol is carried in messages conveyed over another protocol is known as tunneling. The use of tunneling 
has two benefits. It allows the tunnel to be executed in the browser sandbox, since the APIs for accessing an HTTP 
connection to a web server do not have any security restrictions. Secondly it allows the client-server connection to 
traverse firewalls between intranets and internets, as HTTP is the protocol that all firewalls allow to pass through them. 
[0009] The Java applet and the server are thus provided with a connection protocol (such as TCP/IP) which is more 

*5 versatile and capable than the HTTP connection to which they would normally be limited by the firewall and/or proxy 
server. However, the restrictions imposed by the sandbox prevent malicious or inappropriate access to the client ma- 
chine's resources through the tunnelled protocol. 



Brief Description of Drawings 

20 

[0010] A method and apparatus in accordance with this invention, for permitting secure access between a client 
behind a firewall and a service provided by a node (such as a server) external to the firewall, will now be described, 
by way of example, with reference to the accompanying drawings, in which: 



25 Figure 1 is a schematic illustration of circumstances in which the invention can be used; 

Figure 2 shows an outline of an implementation of the invention in the Java programming language; 

Figure 3 illustrates the basic principles of operation of the invention; 

Figure 4 shows in more detail the mechanism for communications from the client to the server; and 

Figure 5 shows in more detail the mechanism for communications from the server to the client. 
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Best Mode for Carrying Out the Invention, & Industrial Applicability 



[001 1] There are in principle two aspects involving firewalls to be considered when establishing a connection between 
an applet in a user's terminal and a web server. The first aspect is the case of a firewall-protected web server to which 
35 a browser running on a machine outside the firewall is trying to connect. This aspect does not present a serious problem 
since the web-server owner can configure the firewall to allow this connection to be made. 

[0012] The second aspect is the case of a web browser machine in a corporate intranet trying to connect through 
the corporate firewall to a web server on the public Internet, as illustrated in Figure 1 . The problem here is that the web 
server owner has, by design, no control over the firewall. The complete scenario is that the browser 10 running java 
40 applet code on the user's terminal will connect initially to a machine 1 2 called a proxy within the corporate intranet 14. 
This proxy makes the required connections through the firewall 16 to the internet-based web server 18 on behalf of 
the browser 10, without exposing sensitive information about the browser and its host terminal to view in the public 
internet. The proxy 12 also performs various security checks, for example to ensure that the requests are valid HTTP 
requests. 

45 [001 3] However, because of the restriction to the use of HTTP, the web server is unable to provide desirable services 
to the client, such as real-time audio, video or interactive chat, which are possible with the use of more powerful 
protocols such as TCP/IP. These more powerful protocols are normally barred from unrestricted use because of the 
threat they pose of enabling inappropriate or intrusive access to resources within the corporate intranet. 
[0014] This problem is overcome by using the invention to establish a "tunnel" through the firewall, carried by the 

50 HTTP messages, and thus open communications sockets which the Java applet and the web server can use to com- 
municate with one another. In this way the web server is enabled to use resources on the browser's host machine, 
albeit still within the tight constraints imposed by the Java sandbox. 

[001 5] The tunnel consists of two Java programming classes, one (called the TunnelSocket) which implements the 
"Socket" class as defined in the Java/1.1 specification, and another (called TunneiServerSocket) which implements 
55 the "Serve rSocket" class defined in that specification. As shown in Figure 2, client-based Java code can access the 
standard APIs (such as open, read, write, close, etc.) provided by a class conforming to the Socket class definition to 
communicate with a another Java application, such as one running on the web server. The server-based Java appli- 
cation can likewise use the same Socket APIs, plus the standard APIs such as accept etc. provided for servlets by the 
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ServerSocket class, to communicate with the client-based Java applet. Messages between the Socket class and the 
ServerSocket class are carried within HTTP messages which traverse the firewall between the client and the server. 
[0016] The HTTP tunnel uses the underlying simple text-based HTTP protocol to produce a reliable connection- 
based protocol between the TunnelSocket and TunnelServerSocket classes. Basic HTTP transactions use one of two 

s methods in the HTTP request message. The GET method is designed to "get" a web page from a web server and the 
POST method is used to send data to a remote web resource or web application. In implementing the present invention, 
a GET based protocol is used for the client to server direction, and a POST based protocol is used for the reverse, 
server to client, direction, as shown in Figure 3. In the event that HTTP is supplanted by a functionally equivalent 
protocol, it is envisaged that the invention could be used with operations in that protocol equivalent to the HTTP GET 

10 and POST operations. 

[0017] A single connection between the TunnelSocket and TunnelServerSocket classes is likely to comprise multiple 
HTTP transactions. To associate these transactions together, a numeric Globally Unique ID (GUID) is generated using 
a combination of random numbers, network address and time of day, so that each GUID generated is a unique quantity 
both in relation to time and location in the network. This GUID is incorporated in each HTTP request, both GET and 
is POST, so that the HTTP protocol can recognize that they relate to the same communications socket. Specifically, the 
GUID is passed as the HTTP Uniform Resource Indicator (URI) (the field used to specify a document name in normal 
use of HTTP). 

[0018] Client to server connection is implemented using the POST HTTP operation. Each time a write is performed 
to the client socket (TunnelSocket class) the request is packaged up and sent as the pay load of an HTTP POST 
20 message, including the relevant GUID, as shown in Figure 4. Each such write operation sends a separate POST 
message through the firewall. 

[0019] The server to client direction is more complex because of restrictions imposed by the proxy server 12 and 
the firewall 16. As shown in Figure 5, the client issues an HTTP GET message which tells the server that a new tunneled 
socket is being established. The HTTP GET request will generate a temporary server-client socket that allows data to 

25 be written from the server to the client. However as a security precaution most proxy server implementations will close 
down such a server-client socket after a time interval P, typically of the order often minutes in duration. Accordingly 
the tunnel established in this invention periodically itself forces a pre-emptive close down of the connection at some 
periodicity T which is shorter in duration than the interval P. This is accomplished by issuing another GET request, to 
force the TunnelServerSocket class to close the existing socket to the client and immediately establish a new socket. 

30 it should be noted that this closure (and subsequent reopening) are performed even though data are still pending to 
be transferred from the server to the client. This approach avoids the overhead of previous techniques, in which the 
client polls the server at some preselected polling interval, usually of the order of 20 milliseconds. These frequent polls 
are equivalent to multiple web page requests, so the server receives a large number of web hits per second, imposing 
a significant overhead and severely restricting the scalability of such prior techniques to handle large numbers of clients 

35 simultaneously. 

[0020] An illustration of the implementation of the invention is provided below, in the form of pseudo-code, which 
emphasizes the significant aspects of the implementation but for the sake of clarity omits minor details which, although 
required in practice, are well within the capability of a person skilled in the art. Likewise the existence is assumed of 
certain subsidiary functions, such as guidFactory for creating a GUID and createFiFolnputStream for reserving and 
40 configuring memory for use as a first-in-first-out (FIFO) buffer; again these functions are individually well known in the 
art, so their internal details do not need to be specified. Depending on the particular manner of implementation, addi- 
tional parameters may be included, for example, in some function calls; this is indicated by an ellipsis (...). 
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1. Server to Client direction. 

la. TunnelSocket InputStream Code - runs on client 
// 

// Open an Input Stream to the Server 

// This is the Server to Client direction of the Tunnel Socket, 
// 

// aerverAdressB - Internet address e.g. fred.hpl.hp.com: 
// standard socket 

// Port number is a port number on the server machine to 
// connect to: standard socket 
15 public open (server Address, portNuaber) { 

// Generate a new Globally unique id for the new socket. 
// The id is unique across the whole network and across 
// time. 

20 //It is generated using a random number generator, the 

// node's network address and the current time, 
guid a guidJactory ( ) i 

// Create a Fifo stream that is used by the application code 
25 //to read data. 

// The underlying protocol reads data off the wire into this 

// Fifo ready for the application. 

t ifolnputstream m createPif olnputStreamO ; 

30 

II Issue an HTTP/1.0 GET request to the server 
// The on-the-wire format is: 
// GET <guid>? commandoes tablish 
// Content -Type=*applicat ion/octet-stream 
InputStream » issueHttpGetRequest (uri»guid, 
que ryString= "commands establish" , 
Content - Type»~appl ication/oc tet - a tream" , 
address ^ aerverAddresB, portoport) ; 

40 II Start a timer thread that is invoked at a periodicity of 

// GETTitneOut seconds. 

// Typically GETTimeOut is a few minutes in duration - but 
// it must be less than the time out period of the proxy. 
// The function switchGETStreamO is called at the 
45 If periodicity with its own thread of execution. 

startPeriodicTimeOutDemon(awltchGBT3treejn() , GBTTimeOut) ; 



35 



If send a new request to the server telling it to close down 
// the current stream and establish a new stream 
// periodically 
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private switchGETStreamO { 
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30 
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// Lock the Input Stream to prevent any race conditions with 
/ / readlnputStream 
lock (input StreamLock) 

newlnput Stream » lssueBttpGetRequest (uri«guid, 
query S tr ing» "command** awi tch" , 
Content -Types *appl ica tion/oc tet- stream" , 
address** server Address, port-port) # 



// Drain the contents of the stream until it is closed by 
// the remote end 
15 while (read(Input3tream, 71 f OS t ream) i« eof) ; 

writeToFifo(FifoInputStraaa) » 

inputs tr earn => newInputStreamO * 
unlock (inputs treamLock) > 



return (> ; 



// Read data from the stream via the Fifo - the Fifo buffers 
// data up ready to be read 
public readlnputStream ( , buf, len) { 

// if there isn't enough data in the Fifo to satisfy this 
// read request - top it up from the wire. 
iC (! Fl f oS tream. length O > len) { 

//Transfer data from the stream established to the Server 
lock (input StreamLock) ; 

writeToFlf o (Fif olnputStream, read ( Inputstream, 

len - Fif oS tream. length () ) i 
unlock (inputs treamLock) > 

} 

// return data in from Fifo 

return (read (FlfolnputStream, - - - , buf, len) ) ) i 



public close ( ) { 



// Stop timer thread 

s topper iodicTimeOutDemon(BwitchGBTSt ream <) ) % 



55 



6 



EP1 081 918 A2 



// Mark stream as closed 
marklnputStreamClosedO t 
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lb. TunnelServerSocket OutputStream - runs on the server 



// Server side Protocol engine handling all requests the server receives 

// sit waiting on a ServerSocket for socket connections to come 
// in _ just standard socket call 
while ((socket ■ accept () still open) { 

httpheader = readHttpHeader (socket) t 

if ( httpHeader. type » "GST" UU 

httpHeader. command o establish) { 

// Mew socket request 
// Open an output Stream 

outputs tream *= socket .get Outputs tre am () t 

// Add an entry to the database of opened Tunnel Sockets 
addBntryTunnelSocketHashDataBase (socket, outputs tream, 
httpHeader. guid) 7 



} else ( httpHeader - type «» ~GBT" && 
httpHeader. command ^switch") { 

// Lookup tunnel Socket 
tunnelsocket » 

lookupEntryTonnelSocketDatabase ( httpHeader . guid) 9 

if Request to switch stream generated by the TunnelSocket 

// timer on the client 

// Close the current output stream 

// Take lock to avoid conflict with 

// tunnelOutputStreamWrite 

lock (tunnelsocket * outputs tream&ock) ; 

close (tunnelsocket ♦ outputs tream) ; 

// establish a new output stream on this new connection, 
tunnelsocket .outputs tream =» socket, ge tOutputStr earn () % 
unlock ( tunnelsocket . outputs treamLock) 
} else (httpheader . type *POST* ) 

// See pseudo-code fragment included below with Server- 
// client direction. 
doPOSTO ; 

} 

tunnelOutputStreamWrite (buf , len) { 

// Take lock to avoid conflict with "switch" in 
/ / tunnel ServerAccept ( ) 
lock (outputStreamitock) 



8 



EP 1 081 918 A2 



write (output Stream, buf, leu) > 
unlock ( outputs treamLock) 
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2. Client-Server direction. 

2a. Client side TunnelSocket OutputStream 

public write (buf, length, ...) { 

// Issue an HTTP/1.0 POST request to the server 
// The on the wire format is : 
// POST <guid> 

// Content-Type=application/octet-stream 
// Content -Length** <length> 

inputStream = iaaueHttpPOSTRequeat (urioguid. 
Con ten t - Length- leng tlx , 

Content -Type- *»pplicat ion/oc te t- a tream" , 
address^aerverAddress, port^port, payload=bu£) ; 



2b. Server side TunnelS ervcrSockct InputStream 
public readServerSidelnputSt ream (buf, len, — ) { 

return (readFromFif o (Fif oServerSidelnputStream, buf, len, • . ) i 

} 



//On receiving a POST request (see lb above) 

// copy the payload into the Fifo buffer ready for the 

// readServerSide InputStream to read it. 

doPOSTO { 

writeToFifo (Fif oServerSidelnputStream, 

httpReques t - readPayLoad ( ) , httpHeader . contentLength) ; 

} 



Claims 

1 . A method of permitting secure access between a service external to a network firewall and a client internal to the 
firewall, comprising the steps of: 
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(a) effecting an HTTP GET operation or equivalent thereof from the client to establish a communications socket 
for communicating data from the service to the client; 

(b) after a predetermined interval effecting another GET operation or equivalent thereof to close the commu- 
nications socket, irrespective of whether access between the service and the client is required to continue; and 
repeating steps (a) and (b) while access between the service and the client is required to continue. 

The method of claim 1 , wherein the predetermined interval is less than another interval after which client software 
enforces termination of a communications socket established by a GET operation or equivalent thereof. 

The method of claim 1 or claim 2, wherein information is transferred from the client to the service by an HTTP 
POST operation or equivalent thereof. 

The method of any one of the preceding claims, wherein successive messages transferred between the client and 
the service are identified by a globally-unique identification created by the client and communicated to the service. 

The method of claim 4, wherein the globally-unique identification is communicated via an HTTP GET or POST 
operation or equivalent thereof. 

The method of claim 5, wherein the globally-unique identification is communicated in a URI relative path compo- 
nent. 

The method of any one of the preceding claims, wherein communications with the client traverse a proxy service 
located on the client side of the firewall. 
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